Method for preparing and using personal and genetic profiles

ABSTRACT

Method and system for preparing a personal genetic profile includes collecting genetic data from an individual, assigning the data to a coordinate system, storing the data, and providing access for retrieval by the individual from whom the genetic data were collected, after receipt of an Identifier that adequately authenticates the identity of the data requester. Locations of genetic markers are provided as three-dimensional coordinates, described with matrix relationships that are consistent with the primary and secondary chemical structure of molecular constituents of a DNA chain for the individual.

This invention claims the benefit of provisional application No.60/235,434, filed Sep. 26, 2000.

FIELD OF THE INVENTION

This invention broadly relates to genetic testing, and more particularlyrelates to providing results of private and anonymous genetic testingthat can be securely accessed on a network.

Most information about advancements in the biotechnology industry andthe mapping of the human genome is written for scientists or medicalprofessionals and is difficult for lay people to even read, let aloneunderstand. Consumers know that these topics are relevant and important,but consumers are uncertain how these topics impact their lives and donot have easy access to the answers. Most importantly, when consumers dobecome aware that genetic testing is an important tool for managingone's health care and making life decisions, customers are wary of suchtesting for fear that access to their genetic information will result indiscrimination in medical insurance, employment education, and housing.

What is needed is a system that provides results of genetic testing thatcan be accessed only through presentation and authentication of aconfidential identifier or key that is optionally known only to the dataowner (e.g., subject of the testing).

SUMMARY OF THE INVENTION

In one aspect of the invention, a method for preparing a personalgenetic profile includes collecting genetic data from an individual,assigning the data to a coordinate system, storing the data, andproviding access for data retrieval by the individual from whom thegenetic data were collected, if the individual is an adult, or from theindividual's guardian or parent if the individual is a minor or has alegally appointed representative. The stored data are preferablyretrieved through a database and network and may be visually examined bythe individual.

In a preferred embodiment for practicing the invention, one collects aDNA sample from oneself, preferably in a non-invasive fashion such as bymeans of an inner cheek swab, saliva sample, fingernail or hairclippings, and the like. A sample submission kit may be provided forcollecting the sample. The kit preferably includes instructions fordelivering the sample to a test facility and may also provide anidentifier for the sample. Identification may include use of a bar code,serial number, or password. The kit also includes the ability to specifythe gene (or disorders associated with genes) for which the user wishesto be tested. The test facility isolates the DNA, standardizes andarchives it, and performs the specified tests. The results of therequested tests are stored in machine-readable form together with one ormore unique user identifiers, such as a serial number and password or abiometric indicium. The results of the tests are preferably converted toa vector format having two or three location coordinates. The user isthen able to utilize a network, provide the user's serial number andauthentication indicium, and access his or her data, which can beinteractively visualized using the mapping coordinates and vector formatof the data.

Visualization of the data preferably utilizes Geographic InformationSystems (“GIS”) software. Thus, the information of an individual'sgenetic profile displayed on a secured database can be displayed inmultiple views representing different levels of complexity, from a macroview of the 23 pairs of human chromosomes to a micro view at the levelof linear array of individual nucleotides or DNA bases. An individualcan click on particular markers to link direct to information about thatmarker, or information about diseases or conditions linked to theindicated marker, and obtain latest industry information, including genetherapies, that are available to work with that marker.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 schematically illustrates a system for practice of the invention.

FIG. 2 illustrates a sequence of layers of response to a user's query inpracticing the invention.

FIG. 3 illustrates a diagram indicating relevant genetic markers.

FIG. 4 illustrates a particular human chromosome and some geneticmarkers associated with that chromosome.

FIGS. 5 and 6 illustrate spatial relationships of consecutive molecularconstituents in a chain to each other.

DESCRIPTION OF BEST MODES OF THE INVENTION

Practice of the invention preferably begins with the use of a collectionkit by which the user collects a DNA sample 13 from himself or herselfin a non-invasive fashion. With reference to FIG. 1, a user 11 indicatesthe gene or disorder for which testing is desired, and the systemcompares selected characteristics associated with one or more genemarkers of the user with corresponding characteristics for the same genemarker(s) as determined from a reference database 15. Optionally, thisincludes statistics and/or confidence levels associated with differentexpressions of a particular characteristic associated with one or moresuch genes. Optionally, part or all of the user's own data areselectively and anonymously added to the reference database 15.

The user 11 requests information on, and a comparison of, a specifiedorgan, tissue, regulatory system or other medical feature, or specifiesa particular malady or disease. This may be accomplished by enteringqueries and/or data in to a Map Guide Server 17, using a keyboard orother data/command entry mechanism 19 and associated viewer 21 thatcommunicate with the Server through an HTML page with embedded Map Guide23 and a secure Internet connection 25. In response to receipt of theuser's query or command, the Map Guide Server queries the database 15,and determines and assembles the relevant data for comparison with theuser's data and, optionally, for estimation of confidence levelsassociated with different plausible results or conclusions, based on thedata presented by the user. An array of user data associated with one ormore gene markers may be consistent with one or more conclusions, witheach such conclusion having its own confidence level(s).

FIG. 2 is a schematic view of several layers of response to a user'squery. At a first (highest) layer 31, a user interface receives theuser's data/command, optionally in response to user answers or responsesto a sequence of questions posed by an expert system or other userinterrogation system. At a second layer 33, the system analyzes the useranswers and responses and determines if the reference database (13 inFIG. 1) has any relevant information. For example, if the user's queriesand commands concern a neuropathological response to a little knownsubstance or to a rare and acute health event, the system may advise theuser that this database, at present, has no information that is relevantto the user's problem.

If the system determines, from the user's answers and responses, thatsome information contained in the database is relevant to the user'sproblem, the reference database is now queried, at a third layer 35, toassemble the relevant information. At a fourth layer 37, the systemreceives and analyzes the user data and compares the user data withcorresponding gene marker data or other data and determines confidencelevels for one or more results or consequences that are relevant to theuser data presented.

Optionally, access to the fourth layer 37 requires presentation ofauthentication of the identity of the requester, using one or more of apassword, a physical or electronic token, and one or more biometricindicia. The fourth layer 37 is optionally divided into two or moresub-regions 37-k (k=1, 2, . . . ), as illustrated in FIG. 2, which maybe overlapping or may be mutually exclusive, with access to eachsub-region requiring presentation of different authentication and beingavailable to a different group of one or more physicians and health careprofessionals. Access to a particular sub-region in the fourth layerrequires a “need to know” the particular information contained in thatsub-region. For example, an osteopathic physician may have no need toknow details concerning effects of a psychotropic drug currently beingtaken by the owner of the data in the fourth layer. In one embodiment,only the owner of the data, as a “super user” has access to allsub-regions within the fourth layer 37.

More than 740 disorders associated with specific genetic profiles havealready been identified. Initially, testing is provided for selectedmarkers, for example, Fragile X and Hemochromatosis, and a database ofidentified and important markers is accumulated. As data on eachadditional marker are added to a reference database, the correspondingmarker in a patient can be examined and compared with information inthis database.

Fragile X is a genetic condition that results in learning disabilitiesin children. It is very useful for parents to know about the presence ofthis condition as early as possible so that the parents can plan for theextra tutoring and educational attention that such a child will require.Fragile X is being considered for mandatory testing at birth and isalready widely accepted as an important, non-controversial test.

Hemochromatosis, a common genetic disorder in the United States, causesthe body to accumulate too much iron, a condition that is typicallyoverlooked or misdiagnosed but can lead to serious liver damage andother health complications if left unchecked. The advantage of testingfor this condition genetically is that is remarkably easy to prevent:individuals need only to donate blood regularly to avoid the onset ofsymptoms and to maintain health.

Other illustrative conditions for which tests can be conducted includethe following disorders which are associated with specific geneticprofiles: colon cancer, osteoporosis, glaucoma, cataracts, breastcancer, melanoma, diabetes, prostate cancer, hypercholesterolemia,dyslexia, and malignant hyperthermia.

Further, if one wishes to have a genetic “fingerprint,” for example, ofone's child or children, a test employing about 100 DNA markers,distributed randomly throughout the genome, can be performed. Thesecommon markers, referred to as single-nucleotide polymorphisms or SNPs(pronounced “snips”), are present at about one site in a thousand in theDNA in a human genome. These random markers provide a cost-effectiveapproach to map on a DNA sample with current technologies. One hundredmarkers of moderate frequency in the population would uniquely identifyevery individual on Earth, with virtually no chance of misidentificationor confusion.

Because the amount of DNA obtained from a typical specimen is sufficientto perform several individual gene marker tests, the user likely willnot need to resubmit a DNA sample if he or she desires additionalgenetic tests.

The genetic testing can be performed by various assay technologies knownand used by persons skilled in the art. As one example, Sequenom, basedin San Diego, Calif., has developed single-nucleotide polymorphismgenotyping technology that is based on MALDI-TOF mass. Another exampleis Orchid Bioscience based in Princeton, N.J., which offers an assaycalled genetic bit analysis (GBA). Another example is Amersham PharmaciaBiotech, based in Cardiff, United Kingdom, which has an assay calledrolling circle amplification from Molecular Staging, located inGuildford, Conn. Each of these companies is developing an integratedinstrumentation package to support high throughput assays. Included inthe Amersham package is an integrated lab information management system(LIMS) and Oracle database, optionally including bar code readingcapability. Several reliable commercial technologies are available formeasuring SNPs in a cost-effective manner.

Depending on the tests to be performed, there will be a varying numberof genotypes required. Testing for Hemochromatosis, for example, willlikely require examination of about eight genotypes.

An important aspect of one embodiment of the invention is that theindividual submitting the sample will have sole access to his or hergenetic information (or, in the case of a parent, to the dependentchild's information), as well as control over its disclosure, use and/ordisposition. Optionally, and within reasonable limits, this individualwill determine whether a health record concerning this individual willbe retained or deleted/destroyed.

In one embodiment, an Identifier can be a four color, two-dimensionalarray of uniform objects, such as triangles, rectangles, polygons,circles or ovals, that will represent an individual's genetic identity.The array dimensions may be 14×14 objects (more generally, with M1×M2objects), or a total of 196 array objects. These 196 array objects,arranged in a selected pattern, will represent the two alleles (orgenetic variants) at 98 distinct genetic loci that vary at a single DNAbase in the human population. Genetic markers of this case are referredto as SNPs. The color of each array object can indicate a genotype orallele. For example, yellow can represent guanine base or G, blue canrepresent cytosine base or C, green can represent adenine base or A, andred can represent thymine or T.

A selected number N genetic markers (e.g., N=8), preferably in an agreedorder, is established. In one embodiment, the markers in the marker setmeet the following criteria: (1) no two markers can be linked by morethan 10 centimorgans genetic distance, assuming a total of 3000centimorgans genetic distance for the entire human genome; (2) nomarkers from the X or Y chromosome are included, to avoid genderidentification being implicit in use of an Identifier; (3) all markersare bi-allelic; any markers known to be tri-allelic or tetra-allelicwithin the general population are preferably excluded from the markerset; and (4) the average frequency of the major allele of all N locidoes not exceed 75 percent in the general human population. A marker setof this specification can uniquely identify one in over three trillionindividuals.

The 14×14 object array (or, more generally, M1×M2 object array)represents consecutively the two alleles of the N markers in anestablished order described above. In the case of heterozygous genotypesof any marker, the alleles are optionally indicated in alphabeticalorder (i.e., AT, AC, AG, CG, CT, and GT).

The 14×14 object array of colored uniform objects may be used as avisual unique identifier, and it may be stylized and personalized by itsowner as a personal logo. The array may be digitized and represented asan array of characters (e.g., A. C, G and T) or numbers (1, 2, 3, . . ., M), and thus may also be used as an electronic identifier for internetsecurity purposes. The Identifier is preferably part of the securitymechanism.

FIG. 3 is a diagram indicating a group of genetic markers that arerelevant to a particular user's query on a disease or malady, organ ortissue or regulatory system, or other health- or medical-related issue.The genetic markers may be displayed in an ordered linear array or as anordered array on a two-dimensional surface, and the relevant markers aredisplayed in a color or texture that is distinguishable from theremaining markers.

FIG. 4 illustrates a particular human chromosome (No. 19 in thisFigure), with small squares indicating the location of genetic markers(for parents P1 and P2) that are relevant to a particular query fromeach parent. The differently shaded bands in FIG. 4 indicate genelocations that “code” for susceptibility to different maladies,including the following: colon cancer; factor V Leiden mutation;hemochromatosis; and malignant hyperthermia; and melanoma.

A key feature in any security architecture is its access controlmethodology. This mechanism controls who has access to what data and,when combined with privilege allocation, what the recipient can do withthat data. Establishing access for users involves user authenticationthat is a means of verifying the user's asserted identity.Authentication can be single factor or, preferably, multi-factor. Afactor is preferably something known to the user (a user ID andpassword), something the user possesses (a token), and/or one or morebiometric indicia (fingerprints, iris scans, retinal scans, facialpatterns, vein patterns, voiceprints, rapid DNA analysis, handwritinganalysis and recognition, etc.). Authentication is improved whenmultiple authentication factors are used together.

The use of the Identifier as part of an authentication processrepresents a unique opportunity to combine token systems withbiometrics. This combination provides ease of use and increased securitysystem performance in that fewer authentication process steps arerequired for higher levels of trust. Further, the Identifier representsa unique form of identifier for the individual. Preferably, theidentifier is rendered in a binary format so that can be transmitted toa personal digital assistant (PDA) device for performing portable and/orremote identification.

Once the testing has been performed, data are processed in accordancewith the invention to allow the genetically profiled user to access andnavigate through visual spatial representation of the data.

The graphic representations of the user's genetic profile data, and aphysical location in the human body associated with such data, can begenerated and displayed using geographic information systems (GIS)technology, whereby a vector graphic is programmatically linked to arecord in a relational database. Examples of such technology areAutodesk MapGuide software or Oracle Spatial 8i. The software userinterface will be modified for genetic and anatomical terminology.

A user can pan and zoom through a visual representation of the variouschromosomes, genes, SNPs, or nucleotide sequences. The user can doubleclick on any of these entities to initiate a selected event, such aslaunching a URL with additional text information or display of a visualrepresentation of how the genetic entities correlate with effects indifferent organs, tissues and regulatory systems of the body.

Genes and genetic markers have three-dimensional coordinates. However,when the human genome was mapped, the sequence of nucleotides (primarystructure) is specified but not the actual physical, three-dimensionallocation of each nucleotide (secondary and tertiary structure). Unlessand until the actual physical, three-dimensional locations of thenucleotides are determined or provided, the present invention assignsgenetic markers and genes to an (x,y) or (x,y,z) coordinate system. In apreferred embodiment this can be done by creating a map that specifiesthe physical location of each chromosome (based on a photomicrograph ofa human cell in metaphase) and places the sequence information for eachchromosome beneath that image. This is a simplification of where eachnucleotide resides, in relation to others, but still allows for somespatial analysis, and more importantly provides a mechanism fordistributing and managing an otherwise cumbersome dataset.

At this point, one has an overview map (photograph) to scale based onthe amount of sequence data that lies beneath the image. The image isspatially referenced and scaled using a Tiff world file format; theraster image is later used as a background image within the MapGuideapplication.

The length of the longest chromosome, chromosome 1, is estimated to be278,691,924 base pairs (mapping units). One can estimate the extent ofthe image to be 4.75× the transverse diameter of chromosome 1 across, or1,323,786,639 units across. Where a representative is 360×365 pixels,the transverse resolution would be 3,677,185 units per pixel. On thatbasis one can create a .TFW file to spatially reference the image. The.TFW file is an ASCII text file containing at least six lines ofinformation:

Line 1: X resolution dimension of a pixel in map units (1 unit=1 basepair) in the X direction=3,677,185.

Line 2: Amount of translation. (standard value zero)

Line 3: Amount of rotation.

Line 4: Negative of the Y resolution dimension of a pixel in map units(1 unit=1 base pair) in the Y direction.

Line 5: X coordinate of the center of pixel 1.1(upper-left)=5*3,677,185=1838593.

Line 6: Y coordinate of the center of pixel 1,1(upper-left)=(365*3,677,185)−1838593=1340333972.

In order to spatially-reference a photomicrograph image of a somaticcell, specification of six numbers is required. An example is:

+3677185.00

−00.00

−00.00

−3677185.00

1838593.00

1340333972

A .TFW file has the same name as the TIFF file it references and islocated in the same folder as the source file. An example is:

photomicrograph.tif

photomicrograph.tfw

Next, the spatially referenced image is imported into a program such asAutocad Map and a baseline vector line segment is drawn down the middleof each chromosomes, starting from the p arm telomere of the chromosome;the direction of the line segment is of significance for subsequenttopological analysis. One then calculates the relative displacement forthe genetic loci values for the chromosomes so that a locus correspondsto a line segment that extends beneath the visual outline for eachchromosome, and the sequence data for the corresponding chromosomesegment are laid on top of that line.

The x and/or y and/or z coordinate values generated in certaincalculations, discussed in the following, are exported with loci asrecords in a table which is then linked to it's associated geneticinformation, (i.e. values used for a tooltip, thematic representation orurl for the given genetic marker).

A sample table for an ODBC data source is as follows

Key Disease Gene Chromosome Starting_BasePair URL Secondary X SecondaryY NM 000410 Hemochromatosis HFE 6 29165296 http://www.any.html 956587139724196265 NM 000038 Colon Cancer APC 5 119602162 http://www.any.html785286552 1147943203 NM 014885 Colon Cancer APC10 4 154750645http://www.any.html 457943103 667513431 NM 016237 Colon Cancer APC5 12133077128 http://www.any.html 1217584431 452284552 NM 005883 ColonCancer APCL 19 587139 http://www.any.html 311487705 954750545 NM 001639Colon Cancer APCS 1 179676945 http://www.any.html 719652162 863777528 NM000251 Colon Cancer MSH2 2 48943103 http://www.any.html 2611353631349376545 NM 000077 Melanoma CDKN2A 9 24196215 http://www.any.html679676345 829168256

These tabular data above are used to create the .sdf layers in theMapGuide application and thematically displayed by diseases. Values usedto create the MapGuide ODBC SDF layer are as follows:

Key: Key Column

Tooltip: Gene

X: Secondary X value

Y: Secondary Y value

By combining the spatially referenced raster image with the ODBC datasource vector data, the user can navigate within and perform spatialqueries on a physical representation of the genetic data.

Once the (x,y,z) coordinates have been determined for a series ofmarkers, a translator can be created, using the standard uniqueidentifiers for the various genetic markers (e.g., from the publiclyavailable NIH datasets). As a result, the process of automating thevisual display of massive genetic datasets can be achieved with minimaltime and effort.

Large genetic data sets can be managed via various sub-layers (e.g.,part of layers 3 and 4 in FIG. 2) that turn on and off at different panand zoom scales to provide an optimal amount of detail at each scale.Additionally, different sub-layers of the representation can be pulledfrom an unlimited number of servers to optimize performance andscalability.

The genetic information on the site can be accessible from a plurality,preferably at least three, of different, interconnected mechanisms:standard text, a visual representation of the human body, and a visualrepresentation of physical genetic entities (i.e., a cell withchromosomes, genes, protein expressions, SNP markers, and nucleotidesequences).

Information distribution mechanisms include the following.

1. Standard Text Query. Here standard text input or pull-down menucontrols launch an SQL query for genetic information as a basis ofdisease, key words or physical location on the body or chromosomes.

2. Query Generated from Representation of a Human Body. A visualrepresentation of the human body, whereby the user is able to pan andzoom down through the physical representation and to double click on oneor more components of the body to trigger display of additional geneticinformation (provided in text form or in the form of a visualrepresentation of the genes and chromosomes in the body that correlateto that physical location on the body).

As illustrations, such information mechanisms would interact as follows.If a user wishes to know about Hemochromatosis (HFE), the user selectsHFE from a pull down menu on diseases. This causes the system to submita query for a graphic representation of the human body and/or of humangenes.

In a graphical representation of the human body, those areas affected bythe specified disease are highlighted, identified via tool tip on amouse over an event and each graphic entity capable of launching a URLwith additional information on the disease. Data on a human body areavailable from the NIH's website. The detailed drawings of the body areconverted to JPEG (compressed) images with the resolution optimized fordifferent zoom scales. Organs, tissues, regulatory systems, and physicalfeatures have a vector outline with transparent fills so that thegraphics are interactive but artistic renderings.

For graphic representation of a cell, the genetic information containedin the chromosomes affected by a specified disease are highlighted,identified via tool tip on a mouse over event, with each graphic entitybeing capable of launching a URL with additional information on thedisease. Different types of information would be available at differentzoom scales (chromosome down to the level of nucleotide sequence andallelic variants). Additionally, using the MapGuide API, a series oftools can be created to enable the user to perform topological analysis(the ability to draw a polygon to select a group of objects with thekeys from the selected objects passed on to generate a report).

For MapGuide, the applications depicting the human body and/or humangenes are preferably part of a special mime type .mwf file that isembedded in the frame of the html as follows:

-   -   <OBJECT ID=“myMap”WIDTH=300 HEIGHT=200    -   CLASSID=“CLSID:62789780-B744-11D0-986B-00609731A21D”>    -   <PARAM NAME=“URL” VALUE=“http://www.dominga.com/maps/0244.mwf”>    -   <EMBED SRC=“http://www.dominga.com/maps/0244.mwf” NAME=“myMap”        WIDTH=300 HEIGHT=200>    -   </OBJECT>

This .mwf file is passed by the network server to a MapGuide agent,which runs as a separate service on top of an NT server. The .mwf fileincludes pointers to different layers of information that are availablefrom any networked server, thus providing extended scalability. In thecase of Oracle Spatial 8i, the links between the graphic entities andrelational database would be set up using JDBC. The raster imageinformation can, for example, be served up using Lizard compressiontechnology with .ric files in MapGuide. In the case of Oracle, globs areused to serve up the background renderings for the images.

One feature of the software applications security mechanism is theability of the data owner (the individual submitting the sample) to setup time-sensitive user accounts to share the owner's medical informationwith his/her doctor, etc. as they determine to be in their ownself-interest. In such cases, user account privileges can be set upwithin MapGuide and tied to either zoom scales, layers or at the vectorentity level, in addition to normal security functionality present atthe network, web server and data source level (i.e., as provided byOracle).

For example, by creating a series of pull-down menus using Cold Fusionor ASP to trigger events tied to the MapGuide API (i.e., turning on andoff layers of data and passing the access key for only the visibleobjects or layer), the genetic data owner can customize the securityrelated to his/her personal data.

Upon log-in, MapGuide will prompt a physician or other health careprofessional for an access code in order to view a limited dataset (asspecified or limited by the end user). Communication between a specificuser and the data owner regarding the information is preferably handledvia an anonymous e-mail account.

A subsequent and related feature of the software security mechanism isthe ability of the data owner to specify that his/her personal data bedestroyed (either in part or in entirety). Hence, as described above,the user can specify layers information or specific markers for whichthe user wishes selected personal data to be destroyed or deleted aspart of the MapGuide interface.

A Process for Calculating (x,y,z) Coordinates

Consider a DNA (or RNA) chain, illustrated in FIG. 5, that contains oneor more representatives of chromosomes as a linear chain CH of N+1molecular constituents, such as nucleotides with an associatedsugar-phosphate backbone (the backbone is ignored as part of the linearchain structure, except for its effects on the secondary and tertiarystructure, discussed below). Assume that a free end of this chain CH(e.g., location of a telomere or other selected atomic or molecularconstituent) is located at an origin O of a Cartesian coordinate systemand that any two consecutive molecular constituents, numbered n and n+1(n=1, . . . , N) in the chain are connected by aconstituent-to-constituent bond of length L_(n). Each link n isinitially aligned parallel to a z-axis of the coordinate system. Theprimary structure of the chain CH is that of N+1 molecular chainconstituents, joined end-to-end and aligned parallel to a selected axis(e.g., the x-axis), with constituents n and n+1 being joined by a linkof length L_(n).

One by one, beginning with the constituent numbered n=1, the initialalignment is relaxed so that each link assumes an orientation relativeto the preceding link that is determined by theconstituent-to-constituent individual and collective interactions, toproduce a secondary structure (and possibly a tertiary structure and/ora quaternary structure) that may have a helical or another substantiallylinear structure. Relative to a direction defined by the orientation oflink number n (considered as a local x-axis), the normalized coordinates(x_(n-1),y_(n-1),z_(n-1)) with x_(n-1) ²+y_(n-1) ²+z_(n-1) ²=1, of thesecond end (far end) of link number n+1 relative to the near end of thislink are given by a matrix relationship:X _(n-1) =R(φ_(n-1),θ_(n-1))X _(n),  (1)

$\begin{matrix}{{X_{n} = {❘{\begin{matrix}x_{n} \\y_{n} \\z_{n}\end{matrix}❘}}},} & (2)\end{matrix}$

$\begin{matrix}{{R\left( {\phi_{n},\theta_{n}} \right)} = {{\begin{matrix}{\cos\;\phi_{n}\cos\;\theta_{n}} & {\sin\;\phi_{n}\cos\;\theta_{n}} & {\sin\;\theta_{n}} \\{{- \sin}\;\phi_{n}} & {\cos\;\phi_{n - 1}} & 0 \\{{- \cos}\;\phi_{n}\sin\;\theta_{n}} & {{- \sin}\;\phi_{n}\sin\;\theta_{n}} & {\cos\;\theta_{n}}\end{matrix}} = R_{n}}} & (3)\end{matrix}$This orientation is illustrated in FIGS. 5 and 6. The non-normalizedcoordinates of the second end of link n+1, relative to the originalcoordinate system with origin O then becomeX′ _(n-1) ={L ₁ PR(1)+L ₂ PR(2,1)+L ₃ PR(3,2,1)+ . . . +L _(n-1)PR(n+1,n, . . . ,1)}X _(n),  (4)PR(n+1,n, . . . ,1)=R(φ_(n-1),θ_(n-1))PR(n,n−1, . . . ,1),  (5)PR(1)=R(φ₁,θ₁).  (6)The link lengths L_(n) may be, but need not be, the same. For example,it is generally accepted that the distance between two consecutive basepairs in a DNA helix is about 0.34 nanometers (nm), under normalcellular conditions of temperature and ionic strength, but this numberrepresents an average or mean value, not a fixed and immutable distance.

Computation of the coordinates of X′_(n-1), given a knowledge of theangles φ_(k) and θ_(k) (k=1 . . . , n+1), is straightforward but timeconsuming, even where the orientation angles φ_(k) and θ_(k) are allknown or prescribed. This task can be made simpler by the followingapproach: for a sequence of spaced apart integers n=n1, n2 (>>n1), n3(>>n2), . . . , compute and store the coordinates X′_(n) for n=n1, n2,n3, . . . For any integer m satisfying nq<m<n(q+1), use the storedquantities X′_(nq) and Eqs. (4) to generate X′_(m), beginning with theknown quantity X′_(nq). The integer differences n(q+1)−nq can be assmall as 5–10, or some larger range of numbers, such as 20–100 or200–800 or 500–1000 or larger. These integer differences need not beuniform.

The link-to-link polar angles θ_(k) (θ<θ_(k)<π) can be treated asapproximately fixed, with a small angular variation, but the azimuthalangles φ_(k) may range over a full cycle (0<φ_(k)<2π) subject to aprobability density function f(φ_(k); θ_(k); k) that reflects well knownstereochemical angle hindrances that arise from neighbor-to-neighborinteractions between adjacent molecular constituents. For example, ifthe energy of interaction between the molecular constituents number k−1and k is E(φ_(k);θ_(k) k), with θ_(k) approximately fixed, theprobability density function can be approximated asf(φ_(k);θ_(k) ;k;T)=exp{−Eφ _(k);θ_(k) ;k)/k _(B) T}/{∫exp{−E(φ′;θ_(k);k)/k _(B) T}dφ′},  (7)where k_(B) is the Boltzmann constant and the density function f nowdepends also depends upon local temperature T. As the temperature Tincreases, the density function f may approach a function that hasuniform amplitude in the azimuthal angle φ_(k).

As a first approximation, the average geometry of a DNA helix can beadopted, having ten nucleotides per full turn (φ_(n-1)−φ_(n)=36°), and abond angle θ_(n) and bond length L_(n) that are related by L_(n) sinθn=0.34 nm. This first approximation can then be varied by imposingconstraints that are determined from visual observations or diffractionmeasurements for a particular user.

1. A method for providing a personal genetic profile, comprising:providing genetic data for an individual, wherein the genetic datacomprises associated attribute information; and mapping the genetic datausing geographic information systems to spatially represent the geneticdata in a thematic map based on the attribute information; providingdata for at least one genetic marker associated with the individual;representing the at least one genetic marker at at least one location inthe thematic map wherein the representation of the genetic markercomprises genetic marker data in alphanumeric format at a locationadjacent to the genetic marker location; providing a general database,including a least one feature associated with the at least one geneticmarker, for a general population of individuals; and receiving anIdentifier associated with the individual, (1) using the Identifier toidentify the individual, displaying at least one of (i) a visualrepresentation of a selected portion of the individual's body indicatingthe at least one location for the at least one genetic marker, (ii) datafor the at least one genetic marker associated with the individual,(iii) selected data on at least one disease associated with the at leastone genetic marker, and (iv) data on at least one regulatory systemassociated with the at least one genetic marker, and (2) comparing dataon the at least one feature associated with the at least one geneticmarker from the general database with the displayed data for theindividual.
 2. The method of claim 1, further comprising including insaid Identifier at least one biometric indicium for said individual. 3.The method of claim 2, further comprising drawing said biometricindicium from a group of biometric indicia associated with saidindividual and consisting of; at least one fingerprint, at lease oneiris scan, at least one retinal scan, at least one facial pattern, atleast one blood vein pattern; at least one voice print of saidindividual's voice; and at least one DNA analysis of a biometric samplereceived from said individual.
 4. The method of claim 1, furthercomprising including in said Identifier a selected token associated withsaid individual.
 5. The method of claim 1, further comprising includingin said Identifier a selected password associated with said individual.6. The method of claim 1, further comprising; providing said generaldatabase in a first data layer to any data user; providing said geneticmarker data in a second data layer; and providing access to saidindividual's genetic marker data only upon presentation of saidIdentifier.
 7. The method of claim 1, further comprising; providing saidgeneral database in a first data layer to any data user; providing saidgenetic marker data in a second data layer having at least a selectedsecond data layer first sub-region and a selected second data layersecond sub-region; providing access to said individual's genetic markerdata in the second layer first sub-region only upon presentation of afirst type of said Identifier; and providing access to said individual'sgenetic marker data in the second layer second sub-region only uponpresentation of a second type of said Identifier.
 8. The method of claim7, further comprising configuring said second layer second sub-region ofsaid genetic marker data to partially, but not fully, overlap saidsecond layer second sub-region of said genetic marker data.
 9. Themethod of claim 7, further comprising configuring said second layerfirst sub-region of said genetic marker data and said second layersecond sub-region of said genetic marker data to have substantially nooverlap.
 10. The method of claim 1, further comprising representing saidat least one genetic marker at at least one location in a coordinatesystem having at least two dimensions.
 11. The method of claim 1,further comprising representing said at least one genetic marker at atleast one location in a coordinate system having at least threedimensions.
 12. The method of claim 1, wherein said process ofrepresenting said at least one genetic marker at said at least oneselected location comprises; providing a primary chemical structure fora sequence of molecular constituents that are part of a DNA chainrepresenting a biological makeup for said individual and including theselected location; providing a secondary chemical structure for thesequence of molecular constituents that are part of the chain; providingat least one value of a bond length between two consecutive molecularconstituents of the chain and at least one value of a bond angleassociated with two consecutive bonds between consecutive molecularconstituents of the chain, where the at least one bond length and the atleast one bond angle are consistent with the primary and secondarychemical structure of said molecular constituents that are part of thechain; and providing a quantitative matrix relationship that describesthree-dimensional coordinates for each molecular constituent that ispart of the chain relative to a preceding molecular constituient in thechain and that is consistent with the primary and secondary chemicalstructure of said molecular constituents that are part of the chain. 13.The method of claim 12, wherein said process of providing said matrixrelationship comprises representing said three-dimensional coordinatesX′_(n+1)=(x_(n-1), y_(n-1), z_(n-1)) for location of a molecularconstituent number n+1 in terms of said three-dimensional coordinatesX′_(n)=(x_(n),y_(n),z_(n)), for location of a preceding molecularconstituent number n in said chain by a relationshipX′ _(n-1) =L _(n) R(φ_(n),θ_(n))X _(n),${{R\left( {\phi_{n}\upsilon_{n}} \right)} = {\begin{matrix}{\cos\;\phi_{n}\cos\;\theta_{n}} & {\sin\;\phi_{n}\cos\;\theta_{n}} & {\sin\;\theta_{n}} \\{{- \sin}\;\phi_{n}} & {\cos\;\phi_{n - 1}} & 0 \\{{- \cos}\;\phi_{n}\sin\;\theta_{n}} & {{- \sin}\;\phi_{n}\sin\;\theta_{n}} & {\cos\;\theta_{n}}\end{matrix}}},$ where φ_(n) and θ_(n) are selected aximuthal and polarangles, respectively, associated with the molecular constituent numbern+1.
 14. The method of claim 1 further comprising creating a pluralityof views within the thematic map based on a zoom scale.
 15. The methodof claim 14 further comprising using at least one of the plurality ofviews to create a selection set of genetic markers for exercisingsecurity administration over said genetic data, wherein exercisingsecurity administration over said genetic data comprises destroying thegenetic data.